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Abstract 



In this paper we initiate the study of minimizing power consumption in the broadcast scheduling 
model. In this setting there is a wireless transmitter. Over time requests arrive at the transmitter for 
*vj ' pages of information. Multiple requests may be for the same page. When a page is transmitted, all 

requests for that page receive the transmission simulteneously. The speed the transmitter sends data 
at can be dynamically scaled to conserve energy. We consider the problem of minimizing flow time 
plus energy, the most popular scheduling metric considered in the standard scheduling model when the 
scheduler is energy aware. We will assume that the power consumed is modeled by an arbitrary convex 
function. For this problem there is a fl(n) lower bound. Due to the lower bound, we consider the 
resource augmentation model of Gupta et al. 1281 . Using resource augmentation, we give a scalable 
algorithm. Our result also gives a scalable non-clairvoyant algorithm for minimizing weighted flow time 
plus energy in the standard scheduling model. 
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1 Introduction 

Wireless transmitters can typically be utilized in a variety of ways. This presents the designer with tradeoffs 
between power, signal range, bandwidth, cost, ect. In this paper we consider the trade off between power 
and performance. Reducing the energy consumption is of importance in many systems. For example, in ad 
hoc wireless networks a typical transmitter is battery operated and, therefore, energy conservation is critical. 
There are two modes of a wireless transmitter where power can be saved. (1) During idle times (2) During 
transmit times. Work on the first mode focuses on putting the system to sleep when not in use lfT71 l21. By 
appropriately putting the system to sleep, energy consumption can be drastically reduced. Another way 
the transmitter can reduce power is scaling the speed of the wireless transmission J3T1 . By using more 
power a signal can be sent at a faster rate or, to save power, the signal can be sent at a slower rate. We 
focus on reducing energy in the second mode. Reducing energy by changing the transmission speed is 
naturally related to the well studied model of speed scaling in scheduling theory. However, now that we are 
in interested in wireless networks, this motivates a generalization of the standard speed scaling model. 

Companies such as IBM and AMD are producing processors whose speed can be dynamically scaled by 
the operating system. Now an operating systems can control the power consumed in the system by scaling 
the processor speed. To model this is scheduling theory, it is assumed that there is a power function P where 
P(s) is the power use when running the processor at speed s. A scheduler now not only chooses the job to 
schedule, but also the speed used to process the job. In other words, a scheduler determines a job selection 
policy and an energy policy. This is known as the speed scaling scheduling model. 

The standard speed scaling model is similar to our setting, except that we are interested in speed scaling 
in wireless networks. Like the speed scaling setting, we assume there is a power function P where P(s) is 
the power used when transmitting at speed s. To model the wireless network, we will assume a well-studied 
model known as the broadcast model. In a broadcast network, there is a single server and n pages of in- 
formation are stored at the server. Over time clients send requests for pages to the server. Multiple clients 
could be interested in the same page of information. When the sever broadcasts a page p, all outstanding 
requests for that page are simultaneously satisfied. This is how the broadcast model differs from the stan- 
dard scheduling model. In the standard model, a each request (job) is for a unique page and broadcasting 
(processing) a page satisfies exactly one request. In the online setting, the server is not aware of a request 
until it arrives to the system. The broadcast model is motivated by applications in multicast systems and 
in wireless and LAN networks |[38l [Tl[3ll29l. Along with practical interest, the broadcast model has been 
studied extensively in scheduling literature |[T0l [3l[Tl [TTl[29ll26l . Similar models have been considered in 
queuing theory and the stochastic scheduling model lfl9l[T8l,l36ll37l . 

The goal of the scheduler is to satisfy the requests in an order which optimizes a quality of service met- 
ric. Naturally, this metric depends on the needs of the system. When speed scaling is allowed, the server 
has a dual objective. One is to minimize the power usage and the other is to optimize the quality of service 
the clients receive. The most popular metric in speed scaling literature is minimizing a linear combination 
of flow time and total energy |4l[9l[l2l|28l|5]]. The flow timqj of a request is the amount of time the server 
takes to satisfy the request. Let F denote the total flow time of a schedule over all requests and let E denote 
the energy consumption of the schedule. The scheduler focuses on minimizing G = F + E. This has a 
natural interpretation. Say the system is willing to spend one unit of energy to reduce p units of flow time. 
For example, a system designer may be able to justify spending 1 erg of energy to decrease the flow time 
by p = 10 micro seconds. The system designer would then desire a schedule that minimizes F + WE. By 
scaling the units of energy and flow time, we can assume that p = 1. The main contribution of this work is 
to initiate the study of energy in the broadcast model. We focus on the the problem of minimizing flow time 
plus energy in the broadcast setting. Besides practical interest in the model, we believe this problem is of 



Flow time is also known as waiting time or response time 



theoretical interest as it is a natural extension of previous work. 

Results: In this paper we give an algorithm for minimizing total flow time plus energy in the online broad- 
cast setting where the power function is an arbitrary convex function. There is a U(n) lower bound on the 
problem of minimizing flow time in broadcast scheduling when all broadcasts are sent at a fixed rate and 
energy is not included in the objective J33l . Since the power function is arbitrary, this lower bound also 
extends to the problem of minimizing flow time plus energy. Thus we will resort to resource augmentation 
lf32l . The resource augmentation model we consider is the same as that introduced recently in [28]. Here, if 
our algorithm is given 1 + e speed advantage over the adversary, then our algorithm uses power P(-y) when 
broadcasting at a rate of (1 + e)j. The rest of the paper will be devoted to proving the following theorem. 

Theorem 1.1. There exists an algorithm that with 1 + e resource augmentation is G ^(^-competitive for 
minimizing total flow time plus energy in broadcast scheduling with an arbitrary power function where 
< e < 1. 

Our algorithm is called scalable since the minimum amount of resource augmentation is used to be 
0(l)-competitive. Given the strong lower bound, this is the best result that can be shown in the worst case 
setting up to a constant in the competitive ratio. Broadcast scheduling is a generalization of the standard 
scheduling model. The standard model can be interpreted as each request being for a unique page. In fact, 
energy plus flow time in the broadcast scheduling strictly generalizes weighted flow time plus energy in 
the standard speed scaling model. Thus, our result also gives a scalable algorithm for minimizing energy 
plus weighted flow time in standard scheduling setting. In the standard model, an algorithm is said to be 
non-clairvoyant if it is online and does not know the processing time of a job. The algorithm we introduce 
is non-clairvoyant when interpreted in standard scheduling model. 

Theorem 1.2. There exists a non-clairvoyant algorithm that with 1 + e resource augmentation is 0(\)- 
competitive for weighted flow time plus energy in the standard speed scaling model. 

Before this work, no non-trivial non-clairvoyant algorithm was known for (un)weighted flow time plus 
energy when the power function can be arbitrary in the standard speed scaling setting. However, it was 
known that no non-clairvoyant algorithm can be 0(l)-competitive for this problem lfT2l . Thus, resource 
augmentation is necessary for an algorithm to be 0(l)-competitive. 

Relation to Previous work: In the standard scheduling setting, speed scaling has traditionally used power 
functions of the form P(s) = s a for some a > 0. These power functions were motivated by the fact that 
the power used by CMOS based processors is approximately s 3 . Recently, modeling the power function 
as an arbitrary convex was suggested to capture the power consumption of more complex systems (8]. As 
mentioned, in this work, we adopt this new general model. 

There is a large amount of literature on scheduling in the broadcast model. The most popular schedul- 
ing metric considered is minimizing average flow time. Recently, it was shown that there exists an online 
scheduler that is a scalable algorithm for minimizing the total flow time of a schedule when broadcasts are 
sent at a fixed rate and energy is not considered in the objective 11301 181. This work builds on the ideas and 
techniques given in [8 ]. In particular, this paper uses the algorithm of (H with the addition feature of being 
power aware. Since the power function considered in this paper is an arbitrary convex function, our work 
generalizes these results. 

Previous work on Flow-time in Broadcast Scheduling: Minimizing flow time (without energy) in broad- 
cast scheduling where broadcasts are sent at a fixed speed has a rich history beginning with the seminal work 



of |[33l[TTi . In ||33"1 . Kalyanasundaram et al. showed that no online deterministic algorithm can be fl(n)- 
competitive. This has since been extended to show that no randomized online algorithm can be Q(y/n)- 
competitive [6]. Due to these strong lower bounds, most previous work has focused on analyzing algorithms 
in a resource augmentation model 11321 . In this resource augmentation analysis, the online algorithm is given 
s-speed and is compared to a 1-speed adversary. An algorithm is said to be s-speed c-competitive for some 
objective function if the algorithm's objective when given s speed is within a c factor of the optimal so- 
lution's objective schedule given 1-speed for every request sequence. An algorithm that is (1 + e)-speed 
0(l)-competitive is called scalable where < e < 1 since it is 0(l)-competitive when given the minimum 
advantage over the adversary. 

In the offline setting, the problem was shown to be NP-Hard |[25l fl4ll . The first 0(l)-speed O(l)- 
approximation was found in ll33l . The best positive result in the offline setting using resource augmen- 
tation is a (1 + e)-speed 0(l)-approximation |6j. Without resource augmentation, (6[ gave a 0(y/n)- 
approximation. This has since been improved to a 0(log 2 nj log log n) approximation Q. It is long stand- 
ing open question whether or not this problem admits a 0(l)-approximation. 

It has been difficult to find competitive online algorithms for broadcast scheduling. The first online 
algorithm was given by Edmonds and Pruhs in ll22l . This algorithm was shown to be (4 + e)-speed 0(1)- 
competitive algorithm via a reduction to a different scheduling problem known as arbitrary speed up curves. 
This reduction takes a s-speed c-competitive algorithm for the speed up curves setting and converts it into 
a (2s)-speed c-competitive algorithm for the broadcast setting. In [23] Edmonds and Pruhs showed that a 
natural algorithm Longest-Wait-Fist (LWF) is 6-speed (9(l)-competitive via a global charging argument. 
The analysis of LWF has been improved to show that the algorithm is (3.4 + e)-speed 0(l)-competitive 

H9. 

Later, Edmonds and Pruhs [24] gave an algorithm LAPS that is scalable in the arbitrary speed up curves 
setting. Using the reduction 11221 this gives a (2 + e)-speed 0(l)-competitive algorithm for the broadcast 
setting. It was a long standing problem whether or not there existed a scalable online algorithm in the broad- 
cast model. This question was resolved by Im and Moseley and Bansal et al. by finding scalable algorithms 
II301 I81. The reduction of ll22l was improved by Bansal et al. (H to show that a s-speed c-competitive algo- 
rithm for the speed up curves setting can be converted into a s(l + e)-speed 0(|)-competitive algorithm for 
the broadcast setting. This shows that BLAPS, the broadcast version of LAPS, is a scalable algorithm in 
the broadcast model. Besides improving the reduction, in (H the algorithm BLAPS was analyzed directly 
via a potential function analysis. This analysis has since been extended to show an algorithm that is (2 + e)- 
speed 0(l)-competitive for the ^-norm of flow time [27]. Recently, a scalable algorithm was found for the 
£k -norms of flow time for all k > IJ2TTI . 

Previous work on Speed-Scalaing: All of the previous work on broadcast scheduling has assumed that 
the scheduler broadcasts at some fixed speed. Although speed scaling has not been considered in broadcast 
scheduling, it has been considered extensively in the standard scheduling model. As mentioned, the standard 
scheduling model can be interpreted as each request being for a unique page. Recall that is the speed scaling 
setting, a power function P is given where P(s) is the power used when running the processor at speed s. 
We first consider the traditional model where P(s) = s a and a > 1. In ll35l an efficient algorithm was 
given for the problem of minimizing flow time offline subject to a bound on the amount of power consumed. 
This algorithm can be extended to find a schedule that minimizes the flow time plus energy in the offline 
setting. The problem of minimizing flow time plus energy online was first studied by EJ. showed that 
the algorithm that runs jobs with power proportional to the number of outstanding requests is 4-competitive 
for unit work jobs. They also showed the Highest-Density-First algorithm coupled with this power strategy 
is 0( . a z ) competitive for weighted flow plus energy. This is since been improved in 11341 for unweighted 

flow plus energy to give a O(y^j) -competitive algorithm. In lfi"2l a O (a 3 ) -competitive non-clairvoyant 



algorithm was given. Chan et al. in lfT3l gave a 0(logm)-competitive algorithm for the speed-up curves 
setting when there is m processors. 

Recently, @ introduced the problem of minimizing flow time plus energy with an arbitrary convex 
power function. Surprisingly, they were able to give an algorithm that is 3-competitive in this general setting 
for flow plus energy. This resolved a central question on whether an algorithm could be 0(l)-competitive 
where the competitive ratio does not depend on a. Gupta et al. gave a scalable algorithm for minimizing 
weighted flow time plus energy in the case where there are m machines, each machine may have a different 
power function, and each power function is an arbitrary convex function ll28l . As mentioned, in this paper 
we adopt the assumption that the power function is an arbitrary convex function. 



2 Preliminaries 

We begin by introducing some notation. There are n pages stored at the server. Each page has a size a p . A 
request for page p is satisfied if it receives a p pieces of page p in sequential order. That is, a request receives 
the transmission of page p from start to finish in that order. This will be further elaborated on later. Notice 
that by this definition, multiple requests can be satisfied by a single broadcast. Let J P: i denote the ith request 
for page p. Let a Pt i be the time this request arrives to the server. In the online setting, this is the first time 
the server is aware of the request. Let f p> i be the time the server satisfies request J p j. The total flow time of 
a schedule is F = ^ Yli(fp,i ~ a p,i)- The following definitions will be useful. 

Definition 2.1 (convex function). A real valued function f is convex if and only if for any < a < 1 and 
any real valued x and y it is the case that f(ax + (1 — a)y) < af(x) + (1 — a)f(y) 

Definition 2.2 (concave function). A real valued function f is concave if and only if for any < a < 1 and 
any real valued x and y it is the case that f(ax + (1 — a)y) > af(x) + (1 — a)f(y) 

Using the definition of a concave function, we can easily show the following proposition. 

Proposition 2.3. For any real valued concave function f where /(0) = 0, the following holds 

• For any positive real number x, ^E- < Z \. 

• For any positive real number x and any a < 1, oef(x) < f(ax) 

We will be given a power function P : R — > E. The value of P(s) is the power used when the server 
broadcasts with speed s. In (5j |28[ with a small loss in the competitive ratio, it was assumed that power 
function P satisfies the following conditions, 

1 . At all speeds, P is defined, continuous and differentiable 

2. P(0) = 

3. P is strictly increasing 

4. P is strictly convex 

5. P is unbounded 

In this paper we adopt these assumptions on P. Notice that the speed could be bounded. If the algorithm 
is given (1 + e) resource augmentation and the maximum speed is s max then our algorithm's maximum 
speed is (1 + e)s max . Let the function Q = P^ 1 . That is, Q{y) is the maximum speed of the processor 
with a limit of y on the power used. Notice that Q(0) =0 and that Q is concave. Let s(t) denote the speed 
used at time t. Let E = f™ Q P(s(t))dt be the total energy used. The goal of the scheduler is to minimize 
G = F + E. 



2.1 Fractional Broadcast Scheduling 

In integral broadcast scheduling (the standard model), a request J pi for page p is satisfied if it receives each 
of a p pieces of page p in sequential order. That is, a page is divided into pieces. At any time the scheduler 
decides which piece to broadcast. A request is satisfied if it receives each of the pieces from start to finish, in 
that order. We now define a different version of the broadcast scheduling problem called fractional broadcast 
scheduling. In this setting, the a p unit sized pieces of page p are indistinguishable. Now a request J p ^ is 
satisfied once the server devotes a total of a p time units to page p after time a Pt i. In (H it was shown that an 
online algorithm running at a fixed speed that satisfies a request J Pt i by time t in a fractional schedule can 
be converted into a different online algorithm that completes J p ^ by time t + 4(i — a p ^) + 4 in an integral 
broadcast schedule. Here, the new algorithm is given an additional e' resource augmentation. 

It is not obvious that this generalizes when the server can use varying speeds over time. In Appendix lAl 
we extend their result to the speed scaling setting. We prove the following theorem. This theorem may be 
of independent interest, since it can be used to reduce integral broadcast scheduling to the fractional setting 
for a variety of objective functions where speeds can vary over time. 

Theorem 2.4. Let S be a fractional broadcast schedule where the server can vary its speed over time and 
let < e' < 1. The schedule S can be converted into a schedule S' using e' resource augmentation such 
that the schedule S' satisfies the following properties 

• The power used by S' is at most the power used by S. 

• If a request J p i is fractionally satisfied at time f Pt i under S then this request is integrally satisfied at 
time f Pj i + p(/ p ,i — a Pj i) + 4 under the schedule S'. 

• If the algorithm that generates S is online then so is the algorithm that generates S'. 

This theorem implies the following corollary. 

Corollary 2.5. An algorithm with 1 + e resource augmentation that is c-competitive for flow time plus 
energy in fractional broadcast scheduling can be converted into an algorithm that is -^-competitive for 
integral broadcast scheduling and uses (1 + e)(l + e') resource augmentation where < e' < 1. 

Due to the previous corollary, we will focus on fractional broadcast scheduling for the rest of this paper. 
It is important to note that if each request is for a unique page, the fractional broadcast setting is equivalent 
to the standing scheduling setting. This reduction is not needed for the proof of Theorem |1.2| 

2.2 The Algorithm 

Let N a (t) contain the unsatisfied requests under our algorithm's schedule at time t. Our algorithm broadcasts 
at speed Q(\N a (t)\) at time t. Intuitively, since the flow time of the schedule at time t increases by |iV (i)|, 
the scheduler can afford to use this much power at time t. Now we define how our algorithm distributes 
its processing power. Here the algorithm BLAPS is used. This algorithm was introduced in Il24l l8l. We 
will be assuming that our algorithm is given (1 + 6e) resource augmentation and this implies the total speed 
our algorithm uses at time i is (1 + 6e)Q(\N a (t)\) where e < ~. The algorithm BLAPS takes a parameter 
(3 < 1. We will fix (3 = e later. Let N^(t) be the f3\N a (t)\ requests in N a (t) with the latest arrival times. 
At any time t, the algorithm equally distributes its processing power amongst the requests in N' a (t). That 
is, for a request J Pj j G A^(i) page p is broadcasted at a rate of x p ^(t) = - — r \ n (t) \ • Notice that there 
could be multiple requests for page p in N' a (t). Let S p (t) be the set of requests for page p in N^(t). A page 
p is broadcasted at a rate of J2j <=s ft) x p,«(*) at ti me *• We call x P; i(t) the amount J Pt i contributes to how 
much page p is broadcasted. Notice that at any time, the algorithm uses total speed (1 + 6e)Q(\N a (t)). 



For brevity, throughout this paper will will be using the following simplifying assumption. We assume 
that at each time t, /3\N a (t)\ is integral. A simple extension of the analysis can be made if this is not the case 
with the following elaboration. Let N' a (t) now be the [/3|iV a (t)] requests in N a (t) with latest arrival times. 
For the |_/3|iV a (£)|J requests with latest arrival times in N' a (t), BLAPS keeps the value of x the same. Let J p j 
be the only other request in N^t). For this request, we set x Pji = ((3\N a (t)\ - [fi\N a {t) |J ) (1+ ^ffii (t)l) • 
That is, J P: i is given processing power proportional to (/3\N a (t)\ — [l3\N a (t)\\ ), the amount J p ^ 'overlaps' 
the (3\N a (t)\ latest arriving requests. 

3 Analysis 

We will be using a potential function argument EOl . Let 4zG(t) be the increase in our algorithm's objective 
at timet. Likewise, let ^G*(t) be the increase in OPT 's objective at time t. Notice that ^G(t) = 2\N a (t)\ 
because there are |iV (t)| outstanding requests at time t, which increases the flow time of the schedule by 
\N a (t)\ and the scheduler uses power \N a (t)\ at time t. Let G = J^° -^G(t)dt denote our algorithm's total 
cost and let G* = J °° ^-G*(i)dt denote the optimal solution's total cost. We will define a potential function 
<&(£) that will satisfy the following conditions: 

1. Boundary Condition: $ is before any request arrives and after all requests complete 

2. Arrival/Completion Condition: $ does not increase when a request is completed by LAPS or OPT 
or when a request arrives. 

3. Running Condition: At all times t it is the case that ^G(t) + ^<£ < c4?G*(t) where c is some 
positive constant. 

By integrating the running condition over time, this is sufficient to show that our algorithm is c-competitive. 
This can be seen as follows. The second equality holds due to i>(0) = <J>(oo) = 0. 

r°° d r°° d d r°° d 

G = l di Gm = J ( dt; G(f) + dt (t))dt - 1 "^Wdt^cG* 

Now we define our potential function. We assume without loss of generality that all requests arrive at 
distinct times, which will simplify the definition of the potential function. For a request J p i G N a (t) let 
rank( J p j) = J2j eNJt) a <a t 1 at trme * be the number of jobs that have arrived during [0, a Pj i] that 
are unsatisfied by the algorithm. Recall that x Pi i(t) is the amount page p is broadcasted by our algorithm 
at time t due to J p> i and a p is the amount of page p that must be broadcasted after a p j to satisfy J p ^. Let 
On(p, i, t\ , £2) = J t 2 x P j(t)dt. Let y*(t) be the amount of page p that is broadcasted at time t by OPT and 

let Opt (p, h,t 2 ) = jll y*(t)dt. We define the variable z Pti (t) for a request J p>i to be gEfojj^gg^W) , 
Our potential function is, 

Our potential function combines the potential functions of JH and ll28l . The potential function of O was 
used for broadcast scheduling without energy, which needs to somehow have the power function reflected 
in the potential if it is to work in our setting. To incorporate the power function we use some ideas given in 

EH. 

As is usually the case, our potential is designed to approximate the algorithm future cost after subtracting 
the optimal solution's future cost. Our algorithm gives higher priority to jobs that have arrived recently. The 



potential function is designed to capture the the remaining cost of the algorithm if it satisfies requests in the 
opposite order of arrival. Fix a request J p ^. If the optimal solution satisfied request J p ,[, then the value of 
z P ,i(t) should be thought of as the remaining volume of page p that must be broadcasted to satisfy request 
J p> i. Assume J Pi i has the highest rank in N a (t). Then Q(rank( J P) i)) is the speed that will be used at time t. 
The value of rank( J p ^) is the number of requests waiting for request J p j to be satisfied. Thus, if requests 

are satisfied in opposite order of arrival, rank( J Pt i) ( n( ra nk( j ■)) ) * s an est i mate on tne A° w trme that 
will be accumulated while the algorithm satisfies J„ , because it will take at least 7^7 — ^K_ — time units to 
satisfy request J p j. 

3.1 Change of the Potential Function 

It is easy to see that the potential function is not affected when the optimal solutions completes a request. 
Also the potential function does not increase when a request J P) j arrives since z P) i = 0. Further, the potential 
is before all requests arrive and after all requests are completed. We now show that <I> does not increase 
when a request is satisfied by the algorithm. This lemmas, combined with the previous arguments shows 
that $ satisfies the boundary and arrival/completion conditions. 

Lemma 3.1. <J> does not increase when the algorithm completes a request. 

Proof. Consider a time t where that algorithm completes a request J Pi j. We can assume that rank( J Pj {) < 
\N a (t)\; otherwise, trivially there is no increase in 3>. The change in <£ is, 

Ac&ff) = - V / (rank(J p /j) -l) Zi _ rank(J pfJ )zi 

U e fa lQ(rank(J yj )-l) Q(r a nk(J p/J )) 

■J p i j t JV a (t),a p t j >a Pi i 

To show that A$(t) < 0, we need to show that 

\Na(t)\ \N a (t)\ 

^— «. (rank(jp'jj — \)Zi sr-^ rank(«/p' j)i 



QfrahkUw i) - 1) ~ f^ Q(i) 

J p i^N a (t),a pl] >a p ^^ y KP ^' ' J p >^N a (t),a plj >a Pil ^ W 

Y^ ( rank (J p / j ) - 1) ^ Q(rank(J p /j) 



< 



£ 



^-— ' rank(J„/ ~) ^-^ Q(rank(J„/ ,•)) 

J p/>j eN a (t),a p , tj >a p ,i V PJ/ J p , tJ eN a (t),a p , i:j >a Pyi ^ K V p '° JJ 



This is true by definition of Q and Proposition 12.31 □ 

Now we concentrate on the running condition, the final property of <5 that needs to be shown. Fix a time 
t. Let N (t) be the set of requests unsatisfied by OPT at time t. First lets consider the change in $(£) due 
to the algorithm's processing. Recall that the algorithm broadcasts page p at a rate of - — b\n (t) \ ^ ue to 
a request J pi G N' a {t). Further, notice that Opt (p, a p ^, t) > a p for any request J pi G N a (t) \ N (t), since 
OPT must broadcast <7p units of page p after a Pt i to satisfy page p. Therefore, for any J p> j G N^(t)\N (t) we 
have 5jZp,i(£) > ^On(p, i, t, 00) = -^w|p. Notice that the rank of each request the algorithm 
is currently working on is at least (1 — /3)N a (t). This is because N^(t) consists of the f3\N a (t)\ requests with 
latest arrival times. Using this, we can upperbound the change in <&(t) due to the algorithm's processing, 

U < -i E rank(J pi / 1+ ff5f>'><- V (! + *)(' ~« (1) 



Next, we consider the change in <£ due to the adversary's processing. Let p* be the page which the 
adversary broadcasts. Let s (t) be the speed the optimal solution processes page p* at. Let P*(t) = 
P(s (t)) be the power OPT uses at time t. The adversary can affect z p * ^ for any request J p *^ £ N a {t). We 
first observe the following, 

d v^ / % v~^ On(p*,i,t,od) /d , „ ,\ d . * , ... , 

— 2^ z p *,i(t)< 2^ — ~^~ — '-^ t Opt{p* : a r ,i,t)J <—0 P t(p*,a r:i ,t) = s o {t)dt 

The last inequality holds since Yli j t . 6 jv ft) On(p*, i, oo) < cr p *. This is because the algorithm needs 
to only broadcast page p* for a total of a v * amount of time to satisfy all outstanding requests for page p*. 

Let J p * t k be the request in N a (t) such that ntrahkU* '* k )) * s rnax ^ m ^ ze< ^- We can upper bound the increase 
in $ due to OPT' processing as, 

d 1 / rank(J p * ifc ) \ d y^ rank(J p * ;fc ),s (t) 

dt " e VQ(rank(J p *, fc )) J dt ^ ™ ^ " eQ(rank(J p%A; )) 

Our goal is to show that 4zG(t) + gr^Ki) < -4^G*(i). First we consider the case when the adversary 
uses power at least |]V a (£)|. In this case, the increase in the algorithm's objective can be charged directly to 
the optimal solution, along with any increase in the potential function. 

Lemma 3.2. IfQ(\N a (t)\) < s (t) (equivalently, P*(t) > \N a (t)\) then ±G{t) + ±®(t) < j^G*(t). 

Proof. First we bound the increase in <!>(£) due to OPT's processing. Intuitively, the increase in OPT's 
objective, due to using a lot of power, is large enough to absorb the increase in <J> and the increase in 
algorithm's objective. By increasing <I> now and charging it to OPT, we can later use the decrease in $ 
to pay for increases in the algorithm's objective. Recall that J p *^ is request in N a (t) for page p* that 

maximizes n(r"nk(i * )V ^ et a l^a(*)l = rank(J p » ^) and let 7 = nrw ]t)\) - Notice that a < 1 and 
7 > 1. The function Q is concave. Therefore, Q(r&nk(J p *^)) > aQ(\N a (t)\) by Proposition 12.31 The 
increase in <!>(£) due to OPT's processing can be bounded as follows. 

/ rank(J p ,, fc ) ^ s Q (t) 1 Q(\N a (t))\ 1 Q(\N a (t))\ 7 

( 7T7 T77 \T - _Q! MVaW 7tv rjl vT - " qJV °W7 nnAT twk < - \N a {t)\ 

V e / Q(rank(Jp» ifc j) e Q(rank(J p » i/c )) e aQ(\N a (t)\) e 

The total power used by OPT is at least P* > 7|iV a (£)| since P is convex and the speed OPT uses is 
7<3(|iV a (t)|). Hence, j- t G*{t) > -f\N a (t)\. Recall that gjG(t) = 2\N a (t)\. Knowing that e < \ we have 
that, 

±G(t) + l*(t) < 2|JV.(t)| + JjV.(t)| < ^\N a (t)\ < ~CT(t) 

D 

Due to the previous lemma, for the rest of this paper we can concentrate on the case where Q(\N a (t)\) > 
s {t). We begin by bounding the increase in <3?(i) due to OPT's processing using this assumption, 



/rank(J p * jfc )\ So (t) /rank(J p%fc )\ Q(\N a (t)\) 



Q(rank(J p . ]fc )) V e / Q(rank(J p * jfc )) 



I e ) OfrankfX.. ^1 V 

/ rank( J p * jfc )\ (|iY a (t)|/rank(Jp* ! fc))Q(rank(Jp* i fe)) 



< 



V e J Q(rank(J p » ifc )) 



< -|JV«(t)| (2) 

e 

The first inequality holds since we assumed that Q(\N a (t)\) > s (t). The second inequality follows 
from definition of Q and Proposition 12.31 We can now prove the final case of the running condition. 

Lemma 3.3. lfs {t) < Q(\N a (t)\) andp = ethen ±G{t) + £$(*) < |^G*(i). 

Proo/. We know that ^G(t) = 2\N a (t)\. The increase in $(£) due to OPT's processing is at most \\N a {t)\ 

and the change in <&(£) due to the algorithm's processing is at most — ^ j _ €N , ( t \\ N m jg — ■ Com- 
bining these, we have 

S°W + S* W s ai^-WI + V.WI - E {1 + 6 f- f,) 

Jp,ieK(t)\N (t) 

J P ,i£K(t) J P ,i&N {t) 

< W«)| + iuvi W | - ( fl+f-'V .Mi - ' 1 + ^'-^V .wi) 

< >.(«)! < I^g-w 

The second to last inequality follows from /? = e < i. The last inequality follows since the optimal 
solution's flow time increases by |iV (i)| at time t. □ 



Combing Lemmas (13.2b and (13.31) along with setting f5 to be e gives the running condition, namely 
3F^(*) + aj*(*) - "^3J^*(*)' Thus, we have shown all the properties of $, which gives the following 
theorem. 

Theorem 3.4. The algorithm with (3 = e < ■* is (1 + 6e)-speed -^-competitive for flow plus energy in 
fractional broadcast scheduling. 



By using the reduction from integral to fractional broadcast scheduling in Corollary (12.51 ) and scaling e, 
we have proven Theorem ll.il 

4 Conclusion 

In this paper we initiated the study of energy in the broadcast scheduling model. We showed a scalable 
algorithm for average flow time plus energy. It is important to note that the algorithm BLAPS explicitly 
depends on the speed e. That is, (3 depends on the speed e. Recently, many algorithms developed for 
scheduling have this dependency lPT6ll2"4ll27ll . It would be interesting to find an algorithm which does not 
depend on e or show no such algorithm exists. It would also be of interest to consider objective functions 
that include energy minimization for the broadcast setting. 
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A Reduction of Integral to Fractional Broadcast Scheduling 

In this section we prove Theorem 12 .41 Consider any sequence of requests and let S denote a valid fractional 
broadcast schedule. We now define an algorithm to construct an integral schedule S' . In the integral schedule 
we will use < e < 1 resource augmentation over the schedule S. Recall that in an integral schedule, a 
request J Pt i must receive a p unit sized pieces of page p in sequential order. We will assume that the fractional 
schedule works on at most one page during a unit time slot. We can assume this without loss of generality 
because we can set a unit time slot to be arbitrarily small, since we are assuming preemption. Let f p i denote 
the time request J Pt i is fractionally satisfied in S. Let /i^ denote the time that J Pi i is integrally satisfied 
in S'. Our algorithm and analysis build on the reduction from fractional to integral broadcast scheduling 
given in [HI where broadcasts are always sent at a fixed speed. Since the processor speeds can vary in our 
setting, in our analysis we will have to be careful about how speed is distributed and how processing power 
is accounted for. 

Our algorithm will keep track of a queue Q of tuples. A tuple will be of the form (p, w, b, k). Here p 
corresponds to a page. The value of k G R will correspond to the part of page p that will be broadcasted. 
The variable w G R will be called the width and b € R will be the start time. Each tuple r = {p,w,b, k) 
will correspond to some request J p ^ and the width will be w = f Pi i — a Pi i. This request will determine the 
part of page p that will be broadcasted and the speed of the broadcast. The request corresponding to a tuple 
could be updated. However, after the first piece of the page is broadcasted, the request corresponding to the 
tuple will not be changed. 

At any time when not broadcasting a page, our algorithm will determine a tuple r = (p, w, b, k) to 
broadcast. Let J p ^ correspond to this tuple. Let s(p, i, k) denote the speed used by S in the fcth time slot 
devoted to page p after time a p ^. Notice that a s(p, i, k) volume of page p is broadcasted in this time slot 
under S. When broadcasting the tuple r, we mean that our algorithm is broadcasting the page p with speed 
(1 + e)s(p, i, k). Our algorithm stops broadcasting p once a s(p, i, k) volume of work is completed. Notice 
that this takes -^— time because our algorithm runs at speed (1 + e)s(p, i, k) and S took one time unit to 
broadcast the s(p, i, k) volume of page p. If k = 1 then the algorithm broadcasts p from the beginning 
of the page. Otherwise, our algorithm broadcasts p from where it left off. Besides scaling the speeds, the 
algorithm is the same as that used in [8]. 
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Algorithm: GenRounding(t) 
All requests arrive unmarked 
Simulate the schedule S 
for Any unmarked request J pi completed by S at time t do 

if There is a tuple r = (p, w, b,k) £ Q for page p where 6 > a Pi i then 
Update the width of r to be u; = min{w, / p j — a P) j} 

else 

Insert the tuple (p, (f Pt i — a P j), oo, 1) into Q 

end if 
end for 

Dequeue the tuple r = {p, w, b, k) with minimum width, breaking ties arbitrarily 
Let J q j be the request which gave r the width w 
Broadcast the tuple r 
if This broadcast was the first unit piece of page p. That is, k = 1 then 

Set b = t 
end if 
if This broadcast was the last unit piece of page p. That is, X^=i s (q, J> &) = °p then 

Mark all requests for page p that arrived before time s 
else 

Update r to be (p, w, b, k + 1) 
end if 



It can be observed that our algorithm is online if S is online. First we show that the power used by our 
algorithm is most the power used by S. 

Lemma A.l. Consider any two tuples tj and r^for the same page q broadcast by S'. Let J q j and J q ^ be 
the requests which gave these tuples their width and speed, respectively. Let j > k then a q j > fa k 

Proof. The algorithm S' put the tuple r^ into Q at time /„ j.. Hence, the start time &(rfc) of r^ must be after 
f q> k. Since the algorithm inserted a new tuple for J q j, it must be that 6(ta,) < a q j. Hence, f q ^ < &( r fc) < 
a q ,j. □ 

Consider any unit time slot in S. The previous lemma implies that only one tuple broadcasted in S' will 
correspond to this time slot. This implies the following claim. 

Claim A.2. The power used by S' is at most the power used by S. 

It remains to show that /J^ — a p j < |(/ p ,j — a Pi j) + | for any request J p ^. Fix a request J p ^. If at time 
f Pt i the request J p j is marked then /^ ^ — a p j < |(/ p ,j — a Pj i) + | and we are done. Thus, we assume that 
J Pi i is unmarked at time f P i. Since, J pi is unmarked, there exists a tuple r = (p, w, b, k) in Q at time t 
where w < f p> i — a Pt i. Let tA be the earliest time before time t where every tuple dequeued during [tA, f P ,i] 
has width less than w. Let ts be the latest time after time t where every tuple dequeued during [/ Pj j,£b] 
has width less than w. By definition of the algorithm, at each time during [tA, £b] only tuples with width at 
most w are dequeued. Let f\ denote the set of tuples broadcasted during [tA, *b]- Notice by definition of 
the algorithm, once a tuple is broadcasted, the width of the tuple never changes. 



Claim A. 3. Consider any t£[] and let J q j be the request that gave r its width. Then a q j > tA 



w. 



Proof. Since the tuple r was broadcasted during [tA, is] the width w(t) must be less than w. By definition 
of tA there are no tuples with width less than w in Q at time i^ — 1. Thus, f q j > tA and by definition of 
width, a q j > f q j — w >tA~ w. □ 
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Let N = | 1 1. All N tuples in FJ correspond to some unit time slot where S preformed a broadcast. By 
the previous claim, all of these broadcasts occurred after time tA — w- Further, by definition of the algorithm 
all of these broadcasts occurred before time £#. This is because, a tuple corresponding to a request J q ^ will 
not be inserted into Q until after time f q ^ and all of the requests corresponding to tuples in \\ are satisfied 
in S' by time £#. This implies that the schedule S uses at least N time slots during [tA — w, ts]- Thus we 
have, 

N <t B -t A + w + 2 

Since our algorithm has e resource augmentation, it takes j^— time to broadcast one tuple. Since our 
algorithm is always busy during [tA, ts] we have that 

iV> (l + e)(t B -t A )-3 

This implies that ts — tA < ^±2. For the request J p j we know that a p ^ > tA — w by Claim lA31 Thus, 
tB ~ a P ,i + w< 2£ ± 5 . Knowing that f vi < t B and e < 1, we have (/^ - a Pti ) < ^idJ Combining this 
with Claim |A!21 and the fact that our algorithm is online, we have Theorem [ 
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